Weakly supervised foreground learning for weakly supervised localization and detection
نویسندگان
چکیده
Modern deep learning models require large amounts of accurately annotated data, which is often difficult to satisfy. Hence, weakly supervised tasks, including object localization (WSOL) and detection (WSOD), have recently received attention in the computer vision community. In this paper, we motivate propose foreground (WSFL) task by showing that both WSOL WSOD can be greatly improved if groundtruth masks are available. More importantly, a complete WSFL pipeline with low computational cost, generates pseudo boxes, learns masks, does not need any annotations. With help predicted our model, achieve 74.37% correct accuracy on CUB for WSOL, 55.7% mean average precision VOC07 WSOD, thereby establish new state-of-the-art tasks. Our model also shows excellent transfer ability.
منابع مشابه
Collaborative Learning for Weakly Supervised Object Detection
Weakly supervised object detection has recently received much attention, since it only requires imagelevel labels instead of the bounding-box labels consumed in strongly supervised learning. Nevertheless, the save in labeling expense is usually at the cost of model accuracy. In this paper, we propose a simple but effective weakly supervised collaborative learning framework to resolve this probl...
متن کاملWeakly Supervised Learning for Salient Object Detection
Recent advances of supervised salient object detection models demonstrate significant performance on benchmark datasets. Training such models, however, requires expensive pixel-wise annotations of salient objects. Moreover, many existing salient object detection models assume that at least a salient object exists in the input image. Such an impractical assumption leads to less appealing salienc...
متن کاملWeakly Supervised Action Detection
Detection of human action in videos has many applications such as video surveillance and content based video retrieval. Actions can be considered as spatio-temporal objects corresponding to spatio-temporal volumes in a video. The problem of action detection can thus be solved similarly to object detection in 2D images [3] where typically an object classifier is trained using positive and negati...
متن کاملWeakly Supervised Learning of Foreground-Background Segmentation Using Masked RBMs
We propose an extension of the Restricted Boltzmann Machine (RBM) that allows the joint shape and appearance of foreground objects in cluttered images to be modeled independently of the background. We present a learning scheme that learns this representation directly from cluttered images with only very weak supervision. The model generates plausible samples and performs foreground-background s...
متن کاملWeakly-supervised Dictionary Learning
We present a probabilistic modeling and inference framework for discriminative analysis dictionary learning under a weak supervision setting. Dictionary learning approaches have been widely used for tasks such as low-level signal denoising and restoration as well as high-level classification tasks, which can be applied to audio and image analysis. Synthesis dictionary learning aims at jointly l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2023
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2022.109279